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AFLP-based method for integrating physical and genetic maps 

Field of the invention 
The present invention pertains to the field of biotechnology, more in particular to 
the field of genomic mapping and more in particular to the field of linking physical and 
genetic markCTs and to the field of integrating physical and genetic m£5)s. The invention 
5 pertains to a method for the integration of physical and genetic maps, more in particular to 
the high throughput building of integrated genetic and physical maps. 

Background of the invention 
Integrated genetic and physical genome maps are extremely valuable for map-based 

1 0 gene isolation, comparative genome analysis and as sources of sequence-ready clones for 
genome sequencing projects. The effect of the availabiUty of an integrated map of physical 
and genetic markers of a species for genome research is enormous. Integrated maps allow 
for precise and r^id gene mapping and mapping of all microsatellite loci and other 
application such as SNA marker based gene manipulation. Various methods have been 

15 developed for assembling physical maps of genomes of varying complexity. One of the 
better characterised approaches use restriction enzymes to generate large numbers of DNA 
fragments firom genomic subclones (Brenner et al, Proc. Natl. Acad. Sci., (1989), 86, 
8902-8906; Gregory a/.. Genome Res. (1997), 7, 1162-1168; Marrae/a/., Genome Res. 
(1997), 7, 1072-1084). These fingerprints are compared to identify related clones and to 

20 assemble overlapping clones in contigs. The utility of fingerprinting for ordering a complex 
genome is limited, however, due to variation in DNA migration fix>m gel to gel, the 
presence of repetitive DNA sequences, unusual distribution of restriction sites and skewed 
clone representation. Moreover, fingerprinting alone, unless combined with other methods, 
does not link genomic clones directly to genetic maps. Therefor most high quality physical 

25 maps of complex genomes have been constracted using a combination of fingCTprinting 
and PCR-based or hybridisation based methods. 

Selective restriction fragment amplification or AFLP is known, for instance firom 
the European patent ^pUcation 0 534 858 and US 6,045,994 by appUcant and from an 
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article by Vos et al Nucleic Acids Research (1995), 23, 4407-4414, incoiporated herein by 
reference. In general, AFLP comprises the steps of: 

(a) digesting a nucleic acid, in particular a DNA or a cDNA, with one or more specific 
restriction endonucleases, to fragment said DNA into a corresponding series of 

5 restriction fragments; 

(b) ligating the restriction fragments thus obtained wdth at least one double-stranded 
synthetic oligonucleotide adapter, one end of which is compatible with one or both 
of the ends of the restriction fragments, to thereby produce tagged restriction 
jfragments of the starting DNA; 

10 (c) contacting said tagged restriction fi-agments under hybridising conditions with at 

least one oligonucleotide primer; 

(d) amplifying said tagged restriction fragments hybridised with said primers by PGR 
or a similar technique so as to cause further elongation of the hybridised primers 
along the restriction fragments of the starting DNA to which said primers 

15 hybridised; and 

(e) identifying or recovering the amplified or elongated DNA fragment thus obtained. 
The ampUfied DNA-fragments thus obtained can then be analysed and/or 

visualised, for instance by means of gel-electrophoresis. This provides a genetic fingerprint 
showing specific bands corresponding to the restriction fragments which have been linked 

20 to the adapter, have been recognised by the primer, and thus have been amplified during the 
amphfication step. The fingerprint thus obtained provides information on the specific 
restriction site pattern of the starting DNA, and thus on the genetic make-up of the 
organism from which said DNA has been derived. 

AFLP can therefore be used to identify said DNA; to analyse it for ttie presence of 

25 specific restriction site patterns, restriction fragment length polymorphisms OEiFLPs) and/or 
specific genetic markers (so-caUed "AFLP-markers"), which may be indicative of the 
presence of certain genes or genetic traits; or for similar purposes, for instance by 
comparing the results obtamed to DNA-samples of known origin or restriction pattern, or 
data thereon. AFLP is eminently suited to characterise genetic markers by means of one or 

30 more of the AFLP firagments fims visualised. 
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The primCTs used in AFLP are such that they recognise the adapter and can serve as 
a starting point for the polymerase chain reaction. To this end, the primers must have a 
nucleotide sequence that can hybridise with (at least part of) the nucleotide sequence of the 
adapter adjacent to the 3' end of the restriction fragment to be amplified. The primers can 
5 also contain one or more further bases (called "selective bases") at the 3'-end of their 
sequence, for hybridisation with any complementary base or bases at the 3'-end of the 
adapter ligated restriction Segment. Located between the part of the primer that hybridises 
to the adapter and the selective bases that hybridise to the restriction fragment, the primer 
may contain a section that is capable of hybridising to the remains of the restriction site. 
1 0 Thus, in general an AFLP primer has the following structure: adapter complementary part- 
restriction site remains complementary part-selective bases. The ad^ter complementary 
part- restriction site remains complementary part is generally depicted as the 'constant 
sequence' of the AFLP primer and the selective bases as the 'variable sequence'. 

As, of all the adapter-ligated restriction fragments present in the mixture, only those 
1 5 fragments that contain bases complementary to the selective bases will subsequently be 
amplified, the use of these "selective" primers will reduce the total amount of bands in the 
final fingerprint, thus making the fingerprint more clear and more specific. Also, the use of 
different selective primers (i.e. different variable sequence) will generally provide different 
fingerprints, which can also be used as a tool for the purposes of identification or analysis. 
20 The selective nucleotides are complementary to the nucleotides in the adapter- 

Ugated restriction fiagments that are located adjacent to the constant primer sequence. 

Primers containing selective nucleotide are denoted as +N primers, in which N 
stands for the number of selective nucleotides present at the 3'-end of the primer. N is 
preferably selected from amongst A, C, T or G. 
25 N may also be selected fiiom amongst various nucleotide alternatives, i.e. 

compounds that are capable of mimicking the behaviour of ACTG-nucleotides but in 
addition thereto have other characteristics such as the c^abiKty of inq)roved hybridisation 
compared to the ACTG-nucleotides or the capability to modify the stability of the duplex 
resulting from the hybridisation. Examples thereof are peptide nucleic acids (PNAs), 
30 locked nucleic acids (LNAs), mosine etc. When the ampUfication is performed with more 
than one primer, such as with PGR using two primers, one or both primers can be equipped 
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witii selective niujleotides. Hie numbar of selective nucleotides may vary, depending on the 
species or on other particulars determinable by the skilled man. In general the number of 
selective nucleotides is not more than 10, but at least 5, preferably 4, more preferably 3, 
most preferred 2 and especially preferred is 1 selective nucleotide. 
5 A +1 primer thus contains one selective nucleotide, a +2 primer contains 2 selective 

nucleotides etc. A primer with no selective nucleotides (i.e. a conventional primer) can be 
depicted as a +0 primer (no selective nucleotides added). When a specific selective 
nucleotide is added, this is depicted by the notion +A or +C etc. 

By amplifying a set adapter Ugated restriction fragments witti a selective primer, a 

1 0 subset of adapter-ligated restriction fragments is obtained, provided that tiie 

complementary base is present at the J5)propriate position in the restriction fragment Using 
a +1 primer, for example, the complexity (and the number of visualised fragments) of the 
amplifi^ mixture is reduced by a fector 4 compared to a amplification with a non-selective 
primer (a +0 primer) Higher reductions can be achieved by using primers with multiple 

15 selective nucleotides, i.e. 16 fold reduction of the original multiplex ration is obtained with 
2 selective nucleotides etc. 

As AFLP provides amplification of both strands of a double stranded starting 
DNA, AFLP advantageously allows for exponential amplification of the fragment, i.e. 
according to the series 2, 4, 8, 16, etc.. Also, AFLP requires no prior knowledge of the 

20 DNA sequence to be analysed, nor prior identification of suitable probes and/or the 
construction of a gene library from the starting DNA. 

For a fiirther description of AFLP, its advantages, its embodiments, as well as the 
techniques, enzymes, adapters, primers and fiirther compounds and tools used therein, 
reference is made to EP-0 534 858, and to Vos et al Nucleic Acids Research (1995), 23, 

25 4407-4414 botii pubUcations are incorporated herein by reference. Also, in tiie description 
hereinbelow, the definitions given in paragraph 5.1 of EP-0 534 858 will be used, unless 
indicated otherwise. 

The potaitial of AFLP as a technology for tiie integration of physical and genetic 
m^s has been recognised before. Klein et al in Genome Research, (2000), 10, 798-807 
30 have described the use of AFLP in the integration of physical and genetic m^s of 

Sorghum. The method of Klein et al. comprises generating AFLP fingerprints using +3/+3 
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selective primers of all individual BAG clones in a libraiy. The method furflier comprises 
the generation of pools of clones. The pools are also analysed by AFLP fingerprinting 
albeit using differoit restriction enzymes and otherwise other circimistances compared to 
the generation of the AFLP fingerprints of all the individual BACs. The use of different 
5 enzyme combinations renders the method difficult. The method of Klein et al. although 
feasible, is complex and laborious and comes with several other disadvantages. One of 
them is that the method is vmsuitable for positioning non-polymorphic markers on the 
integrated map. 

It is a goal of the present invention to provide for an improved method for the 
10 integration of physical and genetic vaaps. It is a fiirther goal of the present invention to 

provide for an improved method based on AFLP. It is yet a further goal of the invention to 
provide for a high througjiput method for the integration of physical and genetic maps that 
results in m^s of improved quaHty such as measured by increased marker densities or as a 
result of more reliable contig generation. 

15 

Description of the invention 
The present invention pertains to methods for the integration of physical and 
genetic markers and for the integration of physical and genetic maps. The present invention 
overcomes many of the previously encountered problems in the art and provides for 

20 significant improvements and advantages that will become clear through this description, 
the claims and the examples. The method is based on AFLP and more in particular based 
on the use of combinations of selective primers in AFLP. The method results in the 
integration of genetic and physical m^s and the linking of physical and genetic markers. 
The invention further pertains to the use of AFLP primers in the methods of the invention 

25 and to a set of structurally defined AFLP primers. 

Detailed description of flie invention 
In one aspect the invention pertains to a method for the integration of physical and 
genetic maps by associating a restriction firagment, preferably an AFLP fiagment with a 
30 genetic marker, providing a Ubrary of clones containing inserts of the (part of) genome of 
interest, pooling the clones into pools and fingerprinting the pools using AFLP primer-pairs 
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of higher selectivity, individually fingerprinting the clones in a library using AFLP primer- 
pairs of lower selectivity. The pools tihiat contain a fiiagment corresponding to the fragment 
associated with the genetic marker are identified and the individual clones containing the 
Augment corresponding to the fragment associated with the genetic marker are identified 
5 and linked. From the fingerprint patterns of the individual clones containing the fi-agment 
associated with the genetic marker a contig is generated. The contig is linked to the genetic 
marker on the genetic map and the process is repeated for all genetic markers, thereby 
providing an integrated physical and genetic map. 

The notion *primer-pairs of higher/lower selectivity' as used hCTein relates to the 
10 number of selective nucleotides in the AFLP primers used for fingerprinting. Thus a +0/-K) 
primer-pair is of a lower selectivity than a +0/-I-1 primer-pair or a +1/+0 primer-pair. 
Similarly a +3/+4 primer-pair is of a higher selectivity than a +2/+2 primer-pair etc. 

In one aspect the invention pertains to a method for linking a genetic and a physical 
genome map comprising tihie steps of: 

1 5 (a) providing individual genetic markers, preferably in the form of a genetic 

map; 

(b) characterising each of the genetic markers by means of at least one AFLP 
fi-agment identified through AFLP fingerprinting; 

(c) providing an artificial chromosome library such as a BAG or YAC; 
^0 (d) generating a multitude of pools, each pool containing a multitude of 

artificial chromosomes fi-om the library; 

(e) generating an AFLP fingerprint for each of the pools; 

(f) selecting firom the multitude of pools the individual pools in which the at 
least one AFLP firagment associated with a genetic marker is present in the 

^5 fingerprint of the individual pool; 

(g) generating an AFLP fingerprint of the individual artificial chromosomes 
firom the pools identified under (f); 

(h) selecting firom the fingerprints of the individual artificial chromosomes of 
step (g) the individual artificial chromosome(s) in which the AFLP firagment 

to associated with the genetic marker is present in the fingerprint; 

(i) generating a contig of the individual artificial chromosomes identified in 
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step (h); 

0) linki ng the contig obtained in step (i) to liie genetic marker on the genetic 
map, thereby obtaining a linkage betsveen the physical marker and the 
genetic marker; 

5 (k) repeating steps (g-j) for all genetic markers and ahgn the contigs obtained to 

thereby obtain an integrated physical and genetic map; 

wherein the forward and reverse AFLP primers used in step (b) and (f) comprise K 
respectively L selective nucleotides at the 3 '-end of the primer, wherein the forward and 
reverse AFLP primers used in step (d) comprise M respectively N selective nucleotides at 
10 the 3'-end of the primer, wherein K, L, M, N are integers from 0 to 10, and wherein K+L ^ 
M+N. 

The method of the invention is flexible. Flexibility in this respect refers to the 
starting material. The method of the invention may start from whole genomes of any 
species, but is also appUcable to only a part of tihe genome or a selected region of the 

1 5 genome or a chromosome or part thereof. 

The method starts by using individual genetic markers. It is preferred that these 
markers have been integrated on a genetic map, but this is not a pre-requisite. The method 
of the inventions is equally well suited to link physical markers to genetic markers, after 
which the physical markers are aligned to from a physical map and at the same time 

20 provide a genetic map. The genetic map may comprise only one siagle genetic marker but 
preferably the map comprises two or more genetic markers. Each of the genetic markers is 
characterised by an AFLP fragment or by a combination of AFLP fragments. The second 
step is the provision of a library of artificial chromosomes. The library can be a Bacterial 
Artificial Chromosome library (BAG) or based on Yeast Artificial Chromosome (YAC). 

25 Other libraries such as based on Cosmids, PAC, TAC or MAC are also possible. Preferred 
is a BAG library. The library is preferably of a high quality and preferably is a high insert 
size genomic library. This means that the individual BAG contains a large insert of the 
genomic DNA under investigation. The size of the preferred large insert is species- 
dependent. Throughout this £^plication reference is made to BACs as examples of artificial 

30 chromosomes. However, it is noted ttiat the present invention is not limited thereto and that 
other artificial chromosomes can be used without departing from the gist of the invention. 
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Preferably the libra3ties contain at least five genome equivalents, more preferably at least 7, 
most preferably at least 8. Particularly preferred is at least 10. The higher the number of 
genome equivalents in the library the more reliable is the resulting contig and the integrated 
maps. 

5 The individual clones in the library are pooled to form pools containing a multitude 

of artificial chromosomes or clones. The pooling may be the simple combination of a 
number of individual clones into one sample (for example, 100 clones into 10 pools of 10 
clones), but also more elaborate pooling strategies maybe used. The distribution of the 
clones over the pools is preferably such that each clone is present in at least two or more of 
10 the pools. The pools are generated based on pooling strategies well known in the art. The 
skilled man is capable selecting the optimal pooling strategy based on factors such as 
genome size etc. The resulting pooling strategy will depend on the circumstances, and 
examples thereof are plate pooling N-dimensional pooling such as 3D-pooling, 6D-pooling 
or complex pooling. 

1 5 Each of the pools is fingerprinted using AFLP and preferably with primer-pairs 

having a high selectivity to thereby generate fingerprints that contain a Umited number of 
bands or firagments. Preferably the primer-pairs used in the fingerprinting of the pools are 
the same primer-pairs that are used to identify the AFLP firagment characteristic of the 
genetic markers in step (b). The primer-pairs with a high selectivity contain more selective 

20 nucleotides than the primer with a low selectivity and are for instance +3/+3 primer-pairs 
or +2/+3 primer-pairs. 

The fingerprints of the pools are subsequentiy screened for the presence or absence 
of the firagments of interest that are associated with the genetic marker(s). From a pool (or 
pools) in which the desired fragment has been found present, each of the individual clones 

25 in that pool (or pools) is fingerprinted using AFLP. The individual clones are preferably 
fingerprinted with primer-pairs having a low selectivity to thereby generate fingerprints 
that contain a large number of bands or fragments that later aid in the generation of reliable 
contigs, hi general in this method, the AFLP fingerprints of the genetic markers, the pools 
and the individual clones are preferably obtained using the same combinations of 

30 en2yme(s) and adapters. Preferably only the primers are different and preferably otily the 
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selective bases are different in number and/or in type. The primer-paiis with a low 
selectivity are for instance +0/+0 primer-pairs or +1/+0 primCT-pairs. 

The fingerprints of the individual clones that have been obtained are now subjected 
to a selection step whereby those Augments are selected that are most likely connected to 
5 the genetic marker of interest. This is achieved by subjecting the individual clones 
containing the desired firagment to a selection step, preferably in siUco. 

In this selection step, the fingerprinting patterns of the individual clones are 
compared. When fingerprinting patterns are compared that are derived from clones with 
overlapping inserts, the respective fingerprints will contain one or more bands/fragments 

10 that are similar in place, intensity, type etc. (See for example Fig 1 ., the bands are marked 
as #,'^ $). With fingerprint patterns that are derived &om clones that do not have 
overl^ping inserts flie chance that one or more bands ejq)ress such a similarity is 
significantly lower. When, based on the fingerprint patterns, it is likely that the associated 
inserts of the clones are overlapping inserts, alignment in sihco results in the generation of 

15 a contig. This contig is then linked to the genetic marker and the genetic marker can be 

positioned on the genetic map. Repeating these steps for various genetic markers ultimately 
results in positioning of all contigs on the genetic map and in the integration of the physical 
and genetic map. 

The advantage of the fingerprinting of the clones using primers with low selectivity 
20 is that many fragments are amplified. Consequently, a large number of physical markers are 
identified. Typically 4-5 times more scorable fragments are generated with a +0/+) 
ampUfication compared to +2/+3 amplification and leads consequently to a 4 or 5 times 
higher resolution of the contigged clones. In this way, a high-resolution physical map is 
generated. This is advantageously in the generation of contigs and it will reduce the number 
25 of singletons that may occur in the generation of contigs. 

The term 'contig' as used herein refers to a continuous sequence of DNA that has 
been assembled fix>m overlapping cloned DNA firagments, following the definition 
provided in ITie Encyclopedia of Molecular Biology (1994. Blackwell). 

In general, the primers of low and high selectivity are characterised in that the 
30 forward and reverse AFLP primers of high selectivity used in the fingerprinting of the 
individual clones and in tiie identification of the AFLP fragment associated with tiie 
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genetic marker comprise K respectively L selective nucleotides at the 3'-end of the primer, 
v^erein the forward and reverse AFLP primers used in the fingerprinting of the individual 
clones from the library comprise M respectively N selective nucleotides at the 3'-end of the 
primer, wherein K, L, M, N are integers from 0 to 10, and wherein K+L ^+N. Thus the 
5 primers of high selectivity contain at least one selective nucleotide more than the primers 
of low selectivity. 

Alignment is performed by comparing band patterns and intensities of the 
fingerprint to identify overlapping patterns. Clones containing overiapping patterns overlap 
can be aligned based thereon. Due to the use of AFLP with low selective primers, the 
10 physical markers present in the clones are abundant and hence the aJigmnent thereof can be 
performed with great accuracy. The contig can be generated using any means known in the 
art The clones can be aligned using software well known in the art for these purposes such 
as FPC (Soderlund C, L Longden, R. Mott, 1997, FPC: a system for building contigs from 
restriction fingerprinted clones. Comput. Applic. Biosd. 13:523-535). This software can be 

15 used in the method of the present invention with cut-off parameters that are between 10'^ - 
10-^^ , preferably between lO"*' - lO"^* . more preferably between 10"^ - 10'" , in particular 
between 10"^ - lO^" , more preferably between 10"' - 10"" , and most preferred 10''°. Cut- 
off is a parameter that that represents the threshold value representing ihe maximum 
allowable probability of a chance match between any two clones. Tolerance is a parameter 

20 that is a measure for the maximum distance or bandwidth which two bands from two 
different clones can differ and still be considered the same band. In this invention, 
tolerance is rated from 0 (identical) to 5 (difference of 0.5 base pair). Preferably, tolerance 
is as low as possible (i.e. 0), but in practice a value of 3-4 (difference of 0.3-0.4 base pair) 
is acceptable and 1-2 is preferred. 

25 The generated contigs are then linked to the genetic marker on the genetic map and 

the whole process is repeated for the other genetic maricers of the genetic map thereby 
integrating the genetic map with flie physical map. 

The method according to the invention has certain advantages over methods 
disclosed in the art. One of the advantages is that the resulting integrated map has high 

30 resolution due to the fingerprintiDg of the individual clones with primer-pairs of low 
selectivity. The amplification step of AFLP with low selective primers will result in the 
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amplification of more adapter-Ugated restriction fragments and consequently to more 
physical markers per clone as compared to similar fingerprinting with primers of hi^ 
selectivity. Due to this increased number of physical markers per clone the subsequent 
generation of a contig is also of a better quality and more clones can be correctly aligned 
5 resulting in an improved integration of the physical and genetic map. 

Another advantage is in the speed of the generation of the contig. With the method 
of the present invention relative small sets of individual clones are selected that are 
associated with the AFLP fragment of the genetic marker of which a contig is generated. 
Small sets have the advantage that the generation of a contig is significantly fester than 
10 when larger sets are used. After placement of the contigs in the genetic map, a more 
reUable integrated map is obtained by aligning the placed contigs and verifying the 
alignments with the position of flie genetic marker on the genetic map. One of the further 
advantages is that the relative genetic distance (generally in centimorgan) are now 
converted to physical distances (in basepairs) and may provide a correction or enhancement 

15 of the genetic m^, for instance by reversing the order of genetic markers or by more 

accurately determining the distance between markers. One of the most distinct advantages 
is that each step is verifiable and correctable. This means that in case an amplification or a 
detection, for instance on a gel, is not of the desired quaUty, this step can be immediately 
corrected until data of the desired quality are obtained. 

20 A significant advantage of the method of the invention over the methods known in 

the art is that the present method provides for the detection of non-polymoiphic markets 
and their subsequent integration in the integrated map. Another advantage associated with 
the method of the invention is that the generation of physical mapping data and linkage to 
liie genetic map are achieved directly &om the start of the method (for each marker 

25 separately). 

hi one embodiment the method can also be used for the Imking or integrating of a 
physical and genetic marker. The method titien comprises the steps of: 

(a) characterismg the genetic marker by means of at least one AFLP fiagment 
identified through AFLP fingerprinting; 
3® 0>) providing an artificial chromosome Ubrary such as a BAG or YAC; 
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(c) gen^ating a multitude of pools, each pool containing a multitude of artificial 
chromosomes from the library; 

(d) generatinganAFLP fingerprint for each of the pools; 

(f) selecting from the multitude of pools those individual pools in which the 
5 AETLP fragment associated with the genetic marker is present in the 

fingerprint of the individual pool; 

(g) generating an AFLP fingerprint of the individual artificial chromosomes from 
the pools identified under (f); 

(h) selecting from the fingerprints of the individual artificial chromosomes of 
10 stop (g) the individual artificial chromosomes in which the AFLP fragment 

associated with the genetic marker is present in the fingerprint; 

(i) generating a contig of the individual artificial chromosomes identified in step 

(h); 

(j) linking the contig obtained in step (i) to the genetic marker on the genetic 
1 5 m^ thereby obtaining a linkage between the physical marker and the genetic 

marker; 

wherein the forward and reverse AFLP primers used in step (b) and (f) comprise K 
respectively L selective nucleotides at the 3 '-end of the primer, wherein the forward and 
reverse AFLP primers used in step d comprise M respectively N selective nucleotides at the 
20 3 '-end of the primer, wherein K, L, M, N are integers from 0 to 1 0, and wherein K+L ^ 

In this embodiment the present invention is used for the single integration of a 
genetic marker witti a (group of) physical marker(s). By repeating these steps for all 
available genetic markers, the complete integrated map can be constructed, but in this 

25 embodiment tins is entirely optional. 

In one embodiment, the difference in the number of nucleotides that are used in the 
set of primers with low selectivity compared to the set of primers of high selectivity, also 
depicted as (K+L) - (M+N), is at least 1, preferably at least 2, more preferably at least 3, 
most preferably at least 4. In one embodiment, the number of selective nucleotides used in 

30 the set of primers of low selectivity, also depicted as is M+N, is at least 0, preferably at 
least 1, more preferably at least 2, most preferably at least 3, 
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The pools of the individual clones that are used in the metiiod of the present 
invention preferably contain between 0,001 and 1 graiome equivalent of the total genome to 
be analysed, preferably between 0.1 and 0.75, more preferably between 0.15 and 0.60, most 
preferably between 0.20 and 0.50. More preferred is between 0.25 and 0.35, more 
5 preferably 0,4, most preferably 0.3 . 

In general, a pool of individual clones is limited in the number of clones it can 
contain. In case of exceptionally large genomes, libraries or very small inserts, it maybe 
advantageously to introduce and additional pooling step. Therefore in a preferred 
embodiment of the invention, the method comprises an additional pooling step. The 
1 0 additional pooling step results in pools that can be fingerprinted using primers with the 
same (high) selectivity as are used for flie fingerprinting of the pools of the initial pooling 
step. It is also possible to use primers of a intermediate selectivity, that is, primers with a 
number of selective nucleotides (P+Q) between K+L and M+N such that K+L ^ P+Q ^ 
M+N. 

15 In an alternative embodiment, it is possible to avoid the use of pools of clones 

altogether by generating a contig of all individual clones that have been fingerprinted using 
primers with low selectivity. Optionally a pre-selection of the individual clones is made 
wherein the clones at least contain the fi-agment that is associated with the genetic marker. 
More in detail: the individual fingerprints obtained using primers of low selectivity are 

20 screened for the presence of the (fi-agment) marker associated with the genetic marker that 
has been identified using the primers with high selectivity. This embodiment is 
advantageously for instance when smaller genomes are investigated or when only a few 
genetic markers are integrated with the physical markers. In such cases there is no real need 
for a pooling step, 

25 In one embodiment to characterise all genetic markers available by AFLP it may be 

preferred to use different primers and/or primer combinations for different genetic markers. 
The primers and/or primer combinations for the fingerprinting of the pools and the 
individual clones can then to be ad£5)ted accordingly. It is even possible to use diJBferent 
enzyme combinations for different genetic markers. The associated contigs of BACs can 

30 be aligned based on the overlsqp of the BAG contigs and the occurrence of identical 
physical markers. This is schematically depicted in Fig 3. 
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The method of the invention is in principle smtable for tiie linkage of one or more 
physical markers to a genetic marker and for the linkage of phjreical markers to the gaietic 
map comprising a miiltitude of genetic markers. It is preferred tiiat the genetic markers are 
qjread evenly across flie m^. It is further preferred that the genetic map is a map that is 
5 regarded in the art as a genetic map with a high marker density. This facilitates the linkage 
of (the contigs of) the physical markers to the genetic map and enhances the quaUty and 
rehability of the resulting integrated physical and genetic map. Preferably the genetic map 
has an (average) density of at least 1 genetic marker per 1000 kb, preferably per 500 kb 
more preferably per 200 kb, most preferably per 100 kb. Particularly preferred is a genetic 
10 map with a density of at least 1 genetic marker per 50 kb. 

When all genetic markers are linked to a contig derived from the individual clones 
in the library, gaps may remain and/or clones may remain that have not been placed on the 
m^. Due to the increased density of the physical markers obtained by the fingerprinting of 
the individual clones with primers of low selectivity, the placement of these clones can be 
1 5 accomplished using conventional aligmnent software, and using techniques such as BAG 
walking and using the BAG end sequences. 

Jn another aspect, the present invention pertains to the use of AFLP primers for the 
integration of genetic and physical maps and to the use of AFLP and AFLP primers in 
linking genetic and physical genome maps. AFLP and AFLP primers have demonstrated to 
20 provide an excellent technique for the integration of the physical and genetic maps. 

In a flirther aspect the invention pertains to the use of a first and a second pair of 
AFLP primers in a method for linking genetic and physical genome maps wherein the first 
pair of AFLP primers comprise K respectively L selective nucleotides at the 3'-end of the 
primer, wherein the second pair of AFLP primers used comprise M respectively N selective 
25 nucleotides at the 3 '-end of the primer, wherein K, L, M, N are integers &om 0 to 10, and 
wherein K+L SM+N. 

By link i ng the physical contigs to the gaietic markers a Gist integrated map can be 
obtained. Based thereon further improvement, correction and refinement of the genetic map 
is possible by generating a contig of the various physical markers linked to a genetic 
30 marker. 
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The invention provides for the construcition of high resolution integrated genetic 
and physical of any genome without the need for prior knowledge of the sequence. 
The invention further feciKtates the efficient construction of detailed physical maps of, 
optionally pre-selected, regions of any genome. The invention fiirther facilitates the 
5 discovery of novel BAG derived (AFLP) markers, both polymorphic and non-polymoiphic 
for the integrated genetic map. 

Description of the Figures 
Figure 1 discloses a schematic representation of the integration of a physical and a genetic 
10 map. The method starts by identifying genetic markers using AFLP fingeiprinting. In this 
case a genetic marker was identified using HindSn. sadMsel as restriction endonucleases in 
the AFLP fingerprinting and Hind+2 and Mse+3 high selective primers carrying AT and 
CCA as selective nucleotides at their 3' end, respectively. The identified AFLP fiagment 
associated with the genetic marker has a length of 344 bp. A BAG Ubraiy is prepared 

15 conq)rising Z clones. The individual clones are pooled (pool I contains clones 1, 2 and 3; H 
contains 3, 5 and Z; and in contains 1,4, and 5) and the pools are fingerprinted using AFLP 
with identical enzymes and primers with the same selectivity as used in the identification 
of the genetic marker (Hind+2(AT) and Mse+3(CCA)). This positively identifies those 
pools that contain the fi:agment AT/CCA-344 (pools I and IH). From each clone present in 

20 the pool (pools I and ID) that contains the fiiagment of interest, (1 , 2, 3, 4, 5) an AFLP 

fingerprint is generated, using primers of lower selectivity, m this case +0/-fO. Each of the 
individual clones is fingerprinted and bands with 344 bp length can be identified ( in 2, 3, 
4, 5). These bands may very well be unrelated to the genetic marker of interest, as they also 
include any fragment that contains other nucleotides than the AT/CCA combination at the 

25 respective ends. From the individual fingerprints containing the 344 fragment the clones 
are selected in silico that are most likely to be linked to the AFLP fragment of interest This 
selection step is based on the conq)arison of one or more bands in tiie fingerprint pattems. 
The more corresponding bands are found m the fingerprints (the bands are depicted as 
$), the higher tiie likelihood that these fingerprints are able to form a contig and tiius afign 

30 the individual clones fix)m the Ubrary to form a contig. The individual clones that contain a 
fragment wifli a length of 344 bp and that are selected based on the in silico process of the 
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fingerprint patterns are aligned using FPC to form a contig. This contig can be placed on 
the genetic map. False positive fragments (i.e. that contain the fragment AT/CCA-344, but 
are not linked to the genetic marker of interest) will not form a contig under the conditions 
used. Repeating these steps for other genetic markers resiilts in the formation of an 
5 integrated physical and genetic map. 

Figure 2: A: Hindm-^l/Msel+S fingerprints of BAG pools of Pristionchus pacificus with 
the previously identified AFLP fragments characteristic of the genetic markers indicated by 
arrows. 

B: Hin6ni+0IMsel+0 fingerprints of individual BACs of corresponding positive pools in 
1 0 Fig 2A.and indication of the characteristic AFLP bands (BAG 1 and BAG2). 

C: Gorresponding HinSm+OIMsel+O BAG contig aud identification of the BAGl and 
BAG2 clones. 

Figure 3: Schematic representation of the method for the iutegration of physical and 
genetic maps, whereby for different genetic markers the linkage between the physical BAG 

15 clones and the respective genetic markers is obtained by fingerprinting the BAG pools and 
the individual fingerprints with different restriction enzyme combinations in the AFLP 
fingerprinting for the different genetic markers. A, B and C are genetic markers. REl and 
RE2 are different restriction enzyme combinations (for instance HindlH/Msel and 
EcoRI/Msel). 1-8 are BAG clones. 

20 A genetic marker A has been linked to physical BAG clones 1-4 using enzyme combination 
REl in the AFLP fingerprinting of the donorparents, the BAG pools and the individual 
BAGs according to the method of the invention. Genetic marker O has been mapped to BG 
clones 6, 9 and 10 using the same enzyme combination REl. Genetic markers X and G 
have similarly been mapped to BAG clones 2,5, 8 and 1,6,7 respectively, using enzyme 

25 combination RE2. Based on the common presence of BAG clones 1 and 2 in A and in B 
and G respectively, the genetic marker A identified with a different enzyme combination 
than genetic markers B and G can now be placed on the common integrated map. Based on 
the common presence of BAG clone 6 in O and G respectively, genetic marker T can be 
located adjacent to genetic marker G on the integrated map, thereby providing for an 

30 integrated map of genetic markers T, A, G and O that have been identified using different 
AFLP enzyme combinations. 
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Examples 

S BAC library construction 

High molecular weight DNA of the nematode Pristionchus pacificus was used to 
construct a BAC library. Partially Hin^SL digested DNA of Pristionchus pacificus was 
separated witii pulsed field gel electrophoresis and ligated into the pIhdigoBAC-536 vector. 
The ligated products were electroporated into electrocompetent DHIOB E, coli cells and 
10 the average insert size was determined on field inversion gel electrophoresis. The 

Pristionchus pacificus BAC library contains 13.500 clones with an all over average insert 
size of >125 kb. The estimated genome coverage of the library is estimated to be at least 7 
X, assuming a 200 Mb total genome size of Pristionchus pacificus. 

15 Tlie pooling of the BAC library 

The pooling of the BAC library was performed by pooling individual 96-well 
plates. Each 96-well plate was inoculated on a standard TY-agarplate with 
chloroamphenicol selection (12.5 mg/ltr), and incubated overnight at 37 Celsius. In tihe 
morning the 96 colonies were resuspended in 1 ml of hquid TY medium and collected 
20 from the agarplates. A standard miniprep procedure was performed and the pooled BAC 
DNA was used for AFLP template preparation. 

DNA preparation of individual BACs and BAC pools 

DNA fi"om individual BACs was prepared according to standard procedures 
25 (Sambrook, J., Fritsch, B.F., and Maniatis, T., in Molecular Cloning: A Laboratory Manual. 
Cold Spring Harbor Laboratory Press, NY, Vol. 1, 2, 3 (1989).) An overnight 1.5 ml 
culture in TY medium and chloroamphenicol (12.5 mg/ltr) selection generates sufficient 
DNA for AFLP template preparation. 

30 AFLP analysis of individual BACs and BAC pools 
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AFLP templates of individual BAG DNA, the BAG pools and the two donoiparents 
that were xised in the genetic mapping population were prepared according to standard 
procedures (Vos et al Nucleic Acids Research (1995), 23, 4407-4414 ) using Hin6m and 
A£rel as the enzyme combination. 

5 

For the -H)/+0 ampUfication of the individual BAG fragments, no preamplification 
step was necessary. The +0/+0 amplification step was directly performed on the 1:10- 
diluted template of the individual BAG DNA. The amphfied +07+0 fragments were 
detected and sized on a MegaBAGE 1000 capillary electrophoresis platform. 

10 AFLP reactions on the BAG pools were preceded by +1/+1 preamphfications with 

one selective nucleotide on each primer. AFLP reactions were performed with primer 
combinations having 2 selective nucleotides on the Hindm primer and 3 selective 
nucleotides on the Msel primer. 

The amplified +21+3 AFLP fiagments were detected and sized on the same 

15 MegaBAGE 1 000 capiUary electrophoresis platform as the +0/+0 fingerprints. 

AFLP adaptor orimers 

The following primers were used to generate the fingerprints of the BAG clones 
91M35 : ^mdni-adaptor primer forward (5'-3') : GTGGTAGAGTGGGTAGG 
20 9 1M37 : /fmdlll-adaptor primer reverse (5 '-3 ') : AGGTGGTAGGGAGTGTAG 

92A1 8 : Myel-adaptor primer forward (5 '-3 ') : GAGGATGAGTGGTGAG 
92A19 : Msel-adaptor primer reverse (5 '-3 ') : TAGTGAGGAGTGAT 

Non selective primer sequences fS*-3») 
25 93Q34:/fi7»din+0 primer (5'.3') : GAGTGGGTAGGAGGTT 

93E40:MreI-K) primer (5'-3*) : GATGAGTGGTGAGTAA 

AFLP selecti ve primer sequences fSVl*) 

Hin6m+2 primers can be labeled with FAM, JOE or NED for detection on MegaBAGE 
30 platform. 

Hinm. +2 primers (5'-3') : 
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Constant sequence: GACTGCGTACCAGCTT 
Selective nucleotides: 



AA 


CA 


GA 


TA 


AC 


CC 


GC 


TC 


AG 


CG 


GG 


TG 


AT 


CT 


GT 


TT 



Afeel +3 primers (5'-3'): 

Constant sequence: GATGAGTCCTGAGTAA 
5 Selective nucleotides: 



AAA 


TAA 


CAA 


GAA 


AGA 


TGA 


CGA 


GGA 


AAC 


TAC 


CAC 


GAC 


AGC 


TGC 


CGC 


GGC 


AAT 


TAT 


CAT 


GAT 


AGT 


TGT 


CGT 


GGT 


AAG 


TAG 


CAG 


GAG 


AGG 


TGG 


CGG 


GGG 


ACA 


TCA 


CCA 


GCA 


ATA 


TTA 


CTA 


GTA 


ACC 


TCC 


CCC 


GCC 


ATC 


TTC 


CTC 


GTC 


ACT 


TCT 


CCT 


GCT 


ATT 


TTT 


CTT 


GTT 


ACG 


TCG 


CCG 


GCG 


ATG 


TTG 


CTG 


GTG 



Software analysis 

All MegaBACE traces were analysed by in-house developed software and the 
resulting datasets of the TKndlU+O / ik&el+O individual fingerprints were processed by FPC 

10 software (Soderlund et al. vide supra) The Hmdin+2 / AfeeH-3 AFLP fingerprints of the 
pooled BAG DNAs and the donorparents of the genetic mapping population were also 
analysed by in-house software. The link of the iKndIII+2 / AfeeI+3 firagment to the 
^mdm+O / Afeel+O firagment in the corresponding individual BAG fingerprint was made in 
silico by searching firagments of comparable length within a certain error margin. The 

15 contigs were generated with a cut-off parameter set at various values between 10"'^ and 10"^"^ 
and the resulting contig evaluated. Tolerance was set at 2-4. ( i.e. 0.2-0.4 basepairs) 



Results and conclusions 
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Genetic markers have been linked to BAG contigs in a very efficient way and 
multiple genetic markers have been combined into an integrated physical and genetic map. 
The concept is applicable to different species with varying genome sizes with success, and 
without prior knowledge of the specific genomic sequence. The method provides for the 
5 effective integration of physical and genetic maps and will have an important impact in the 
genonodc arena. 



